Reliability of True Cutting Scores for Rasch Calibrated Items
نویسندگان
چکیده
This paper provides formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty parameters when the trait distribution is normal or logistic. With the proposed formula, one can evaluate the theoretical values of classical reliability indexes for norm-referenced and criterion-referenced interpretations without information about raw-score or trait scores of persons from the target population. This is achieved by representing the theoretical (marginalized) values of the true-score components of reliability indexes as functions of the item difficulty parameter. As the analytic forms of such functions are developed for individual items (and then "summarized" at test level), one can know the population values of true-score measures and reliability for a set of Rasch calibrated binary items prior to their administration. An example for the application of the proposed formulas and their empirical validation is also provided. (Contains 2 tables, 2 figures, and 31 references.) (Author) Reproductions supplied by EDRS are the best that can be made from the original document
منابع مشابه
Reliability and true-score measures of binary items as a function of their Rasch difficulty parameter.
This article provides formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty when the trait (ability) distribution is normal or logistic. The proposed formulas have theoretical value and can be useful in test development, score analysis, and simulation studies. Once the items are calibrated with the dichotomous Rasch model, one can esti...
متن کاملComparing traditional and Rasch analyses of the Mississippi PTSD Scale: revealing limitations of reverse-scored items.
This study examined whether Rasch analysis could provide more information than true score theory (TST) in determining the usefulness of reverse-scored items in the Mississippi Scale for Posttraumatic Stress Disorder (M-PTSD). Subjects were 803 individuals in inpatient PTSD units at 10 VA sites. TST indicated that the M-PTSD performed well and could be improved slightly by deleting one item. Fac...
متن کاملMarginal True-Score Measures and Reliability for Binary Items as a Function of Their IRT Parameters
This article provides analytic evaluations of population true-score measures for binary items given their item response theory (IRT) calibration. Under the assumption of normal trait distribution, the expected values of marginalized true scores, error variance, true score variance, and reliability for norm-referenced and criterion-referenced interpretations are presented as a function of the it...
متن کاملApplication of Rasch Model in Evaluating the Reliability and Quality of Examination Paper for Object-oriented Design Course
Exam has been used enormously as an assessment tool to measure students’ academic performance in most of the higher institutions in KSA. A good quality of a set of constructed items/questions on mid and final exam would be able to measure both students’ academic performance and their cognitive skills. We adopt Rasch Model to evaluate the reliability and quality of the first mid exam questions f...
متن کاملReliability in the Rasch model
This paper deals with the reliability of composite measurement consisting of true-false items obeying the Rasch model. A definition of reliability in the Rasch model is proposed and the connection to the classical definition of reliability is shown. As a modification of the classical estimator Cronbach’s alpha, a new estimator logistic alpha is proposed. Finally, the properties of the new estim...
متن کامل